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Abstract. We present a detailed X-ray spectral analysis of the sources in the IMs catalog 
of the Chandra Deep Field South (CDFS) taking advantage of optical spectroscopy and 
photometric redshifts for 321 extragalactic sources out of the total sample of 347 sources. 
As a default spectral model, we adopt a power law with slope F with an intrinsic redshifted 
absorption A'^, a fixed Galactic absorption and an unresolved Fe emission line. For 82 X- 
ray bright sources, we are able to perform the X-ray spectral analysis leaving both F and 
N/j free. The weighted mean value for the slope of the power law is (F> =^ 1.75 ± 0.02, and 
the distribution of best fit values shows an intrinsic dispersion of cr„„ ^ 0.30. We do not 
find hints of a correlation between the spectral index F and the intrinsic absorption column 
density A'^. 

We then investigate the absorption distribution for the whole sample, deriving the Nh 
values in faint sources by fixing F = 1.8. We also allow for the presence of a scattered 
component at soft energies with the same slope of the main power law, and for a pure 



2 P. Tozzi et al.: X-ray spectral properties of AGN in the CDFS 

reflection spectrum typical of Compton-thick AGN. We detect the presence of a scattered 
soft component in 8 sources; we also identify 14 sources showing a reflection-dominated 
spectrum. The latter are referred to as Compton-thick AGN candidates. 

By correcting for both incompleteness and sampUng-volume eff'ects, we recover the 
intrinsic Nh distribution representative of the whole AGN population, f{NH)dNH, from 
the observed one. fiNn) shows a lognormal shape, peaking around log{NH) - 23.1 and 
with cr - 1.1. Interestingly, such a distribution shows continuity between the population of 
Compton-thin and that of Compton-thick AGN. 

We find that the fraction of absorbed sources (with Nh > 10^^ cm"^) in the sample 
is constant (at the level of about 75%) or moderately increasing with redshift. Finally, we 
compare the optical classification to the X-ray spectral properties, confirming that the cor- 
respondence of unabsorbed (absorbed) X-ray sources to optical Type I (Type II) AGN is 
accurate for at least 80% of the sources with specttal identification (1/3 of the total X-ray 
sample). 

Key words. X-rays: diffuse background - surveys - cosmology: observations - X-rays: 
galaxies - galaxies: active 

1. Introduction 

Deep X-ray surveys with Chandra (Brandt et al. 2001; Rosati et al. 2002; Cowie et al. 2002; 
Alexander et al. 2003; Barger et al. 2003) and XMM (Hasinger et al. 2001) showed that the so 
called X-ray background (XRB) is mainly provided by Active Galactic Nuclei (AGN) both in the 
soft (0.5-2 keV) and in the hard (2-10 keV) band. In particular, major progress has been made in 
the hard band, for which the sources known before Chandra were providing only ~ 30% of the 
XRB (Cagnoni et al. 1998; Ueda et al. 1999a). While some evidence of spectral hardening was 
found towards faint fluxes (e.g. Delia Ceca et al. 1999), most of the X-ray sources were identified 
with Broad Line AGN with a typical X-ray spectral slope of F = 1.7 - 2.0, steeper than that of 
the XRB (F ^ 1.4). On the contrary, the source population discovered by Chandra and XMM at 
fluxes below ~ lO'^^ - 10"" erg cm ^ s Mn the hard band, is constituted mostly by obscured 
AGN with a hard spectrum, and provides the solution to the "spectral paradox" as predicted by 
the XRB synthesis models (Setti & Woltjer 1989, Madau, GhiseUini & Fabian 1993; Comastri, 
Setti, Zamorani & Hasinger 1995; GiUi, Salvati & Hasinger 2001). The detection Umit reached 
in the hard band in the 2Ms exposure of the Chandra Deep Field North is 5 ^2x10"^* erg s"' 
cm"^ (Alexander et al. 2003) and a factor 2 higher in the IMs exposure of the Chandra Deep Field 
South (CDFS, Rosati et al. 2002; Giacconi et al. 2002). The XRB is now resolved at the level of 
~ 80% in the 1-2 and 2-8 keV bands (see Hickox & Markevitch 2005), with the AGN providing 
the large majority of the resolved fraction. While a non-negUgible part of the unresolved fraction 
in the soft band is expected to be contributed by a diffuse warm intergalactic medium (e.g., Cen 
& Ostriker 1999), Worsley et al. (2004; 2005) pointed out that at £ > 6 keV less and less of 
the hard XRB is resolved, showing that a significant population of strongly absorbed, possibly 
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Compton-thick sources, preferentially at z < 1, is still not observed (see also Comastri 2004; 
Brandt & Hasinger 2005). 

The two Chandra Deep Field Surveys lead to the detection of several populations of X- 
ray extragalactic sources: unabsorbed AGN (defined as sources with absorbing column densities 
Nh < 10^^ cm"^), usually identified with optical Broad Line (Type I) AGN and QSO; absorbed 
AGN (with column densities A^^ > 10^^ cm~^), optically identified mostiy as narrow line (Type 
II) AGN, distributed around moderate redshifts z ~ 1 (see Barger et al. 2002; Szokoly et al. 2004); 
X-ray bright, optically normal galaxies (XBONG, see Comastri et al. 2001) which generally 
harbor obscured AGNs; high redshift Type II QSO (see Norman et al. 2002; Stem et al. 2002; 
Mainieri et al. 2005a; Rak et al. 2005); starburst and quiescent galaxies at z < 1 (Bauer et al. 
2002; Homschemeier et al. 2003; Norman et al. 2004), which contiibute to the XRB only 2 - 3% 
in energy, but they are expected to outnumber the AGN at fluxes below 1 x 10"^^ erg cm"-^ s"^ 
(Bauer et al. 2004a). In this Paper we wiU focus on the X-ray properties of the AGN population, 
in order to provide a basehne for possible models of the AGN formation and evolution. 

Tentatively, the difl'erent classes of AGN-powered X-ray sources can be associated to three 
phases: a first phase of strong accretion onto the massive black hole, characterized by high intrin- 
sic absorption and intense star formation (for recent evidence in the submm range see Alexander 
et al. 2005a), followed by an unobscured phase, and subsequent fading (see Fabian 1999; Granato 
et al. 2004). A test of this or other possible scenarios for the accretion history and galaxy for- 
mation in the Universe, requires a good knowledge of the distribution of the X-ray properties 
of the AGN population, in particular intrinsic luminosity and intrinsic absorption as a function 
of cosmic epoch, as well as their relation with the optical properties. The distribution of the in- 
trinsic absorption, Nh, is known only for local, opticaUy selected Seyfert II galaxies (Risahti et 
al. 1999). These local samples, selected to be complete as a function of intrinsic luminosity, typ- 
icaUy include medium or low luminosity sources, and about 50% of them are Compton-thick. 
Difiiculty of assembling large unbiased AGN sample as a function of intrinsic luminosity, has 
hampered attempts to measure the A^^ distribution. The Nh distribution and the evolution of the 
fraction of absorbed sources, has been investigated recently by Ueda et al. (2003) from a combi- 
nation of surveys from HEAOl, ASCA and Chandra. Their sample is dominated by bright, low 
absorption AGN, and their Nh distribution is broadly peaked above A^^ > 10^^ cm"^. Except for 
a few objects with good photon statistics, Ueda et al. use the redshift and the hardness ratio to 
derive the intrinsic luminosity distribution in the 2-10 keV band as a function of redshift, without 
performing a single-source analysis. Similar results have been recentiy obtained by La Franca et 
al. (2005) on the basis of the HELLAS2XMM sample combined with other catalogs. At brighter 
fluxes, other investigations are under way both with Chandra and XMM in wide, shaUower sur- 
veys (ChaMP, Green et al. 2004; Silverman et al. 2005; XMM-BSS, Delia Ceca et al. 2004; 
CLASXS, Yang et al. 2004, Steffen et al. 2004; HELLAS2XMM, Baldi et al. 2002; Perola et al. 
2004). We beheve that these X-ray surveys, designed to bridge the gap between the pencil beam, 
deepest surveys and the wide shallow ones from previous missions, are probably biased against 
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heavily absorbed faint AGN, whose fraction is expected to increase towards fainter fluxes. On 
the other hand, optical surveys can actuaUy discover heavily obscured AGNs at moderate redshift 
(z < 1.3) but only through extensive optical spectroscopy of large sample of galaxies, such as 
the SDSS, among which type U AGNs can be identified from the strong narrow emission lines 
(for example, [On]/l3727A or [Oin]/15007A). In the absence of high-sensitivity X-ray surveys 
above 10 keV, we propose that the search for the stiU missing strongly absorbed AGN population 
can be best performed through a detailed spectral analysis of faint sources detected in very deep 
X-ray surveys. 

In this Paper, we present a systematic study of the X-ray spectra of all the sources in the 
CDFS, taking advantage of spectroscopic (Szokoly et al. 2004) and photometric (Zheng et al. 
2004; Mainieri et al. 2005a) redshifts from the optical foUow-up program with the ESO-VLT. 
Given the flux limits in the CDFS (5.5 X 10"" and 4.5 X 10"^^ erg cm"^ s"^ in the soft and hard 
band respectively), the 347 sources detected (346 from the catalog of Giacconi et al. 2002 plus 
one added in Szokoly et al. 2004) are mostly AGN, with a fewer number of normal or star forming 
galaxies with respect to CDFN, where, thanks to the lower flux limits, normal galaxies start to be 
a significant fraction of the faint source population. The Paper is structured as follows. In §2 we 
briefly describe the X-ray and the Optical data. In §3 we describe our X-ray spectral analysis 
procedure, after dividing the sample into two subsamples based on the counts statistics. In §4 
we present the X-ray spectral analysis of the X-ray bright sample, focusing on the slope of the 
power law component. In §5 we present the X-ray spectral analysis for the whole sample of 321 
sources with measured redshift and total luminosity Lx > 10"*^ erg s^ (we exclude the faintest 
luminosity bin which is doninated by normal galaxies), focusing on the intrinsic absorption. In §6 
we discuss the distributions of the X-ray spectral properties after correcting for incompleteness 
and sampling-volume effects, deriving in particular the intrinsic absorption distribution. This 
aUows us to estimate the fraction of absorbed sources in our sample as a function of epoch. 
Finally, in §7 we compare the X-ray and optical properties, revisiting the comparison of the 
Optical vs X-ray classification scheme proposed by Szokoly et al. (2004). Our conclusions are 
summarized in §8. Luminosities are quoted for a flat cosmology with A = 0.7 and Hq = 70 
km/s/Mpc (see Spergel et al. 2003). 

2. The data 

The IMs dataset of the CDFS is the result of the coaddition of 1 1 individual Chandra ACIS-I 
(Garmire et al. 1992; Bautz et al. 1998) exposures with aimpoints only a few arcsec from each 
other. The nominal aim point of the CDFS is a =3:32:28.0, 8 = -27:48:30 (J2000). The reduc- 
tion and analysis of the X-ray data are described in Giacconi et al. (2001), Tozzi et al. (2001) and 
Rosati et al. (2002). The final image covers 0.108 deg^, where 347 X-ray sources are identified 
(the catalog is presented in Giacconi et al. 2002). Here we use an updated X-ray data reduction, 
where we used Ciao 3.0.1 and CALDB2.26, therefore including the correction for the degraded 
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effective area of ACIS-I chips due to material accumulated on the ACIS optical blocking fil- 
ter at the epoch of the observation. We also apply the recently released, time-dependent gain 
correction'. 

We briefly recall the main steps of the spectral analysis of the reduced data. First we extract 
the photon files and the spectrum (pha file) for every source in our catalog, along with the cor- 
responding background. The area of extraction of each source, as described in Giacconi et al. 
(2001), is defined as a circle of radius = 2.4 x FWHM (with a minimum radius of 5 arcsec). 
The FWHM is modeled as a function of the off-axis angle to reproduce the broadening of the 
PSF. The background is extracted from an annulus with outer radius Rs + 12" and an inner radius 
of Rs + 2", after masking out other sources. Each background spectrum samples more than 400 
photons in the 0.5-7 keV range. We create a response matrix and an ancillary response matrix 
for each source. To do that, we first create the two matrices in the source position in each of the 
1 1 observations of the CDFS (therefore the effect of the degraded effective area of ACIS-I chips 
is applied individually to each pointing). Finally we sum the 1 1 files weighting them for the ex- 
posure time of each exposure. We notice that most of the sources show variability (see PaoUUo et 
al. 2004), therefore our measured fluxes and luminosities are time-averaged on the observation 
epochs. We also stress that, assuming there is no significant changes in the spectra, we correctly 
measure the spectral shape of each source, since the response matrices are time-averaged on the 
same epochs, keeping track in the most detailed way of the characteristics of the different regions 
and the different conditions of the detector at the time of the observations. 

The spectroscopic identification program carried out with the ESO-VLT is presented in 
Szokoly et al. (2004). The optical classification is based on the detection of high ionization 
emission lines. The presence of broad emission lines (FWHM larger than 2000 km/s) like Mg//, 
C///, and, at large redshifts, C/y and Lya, identifies the source as a Broad Line AGN (BLAGN), 
Type-1 AGN or QSO in the simple unification model (Antonucci 1993). The presence of un- 
resolved high ionization emission lines (like O///, Ney, Ne/// or He//) identifies the source as a 
High Excitation line galaxy (HEX), often implying an optical Type-2 classification. Objects with 
unresolved emission lines consistent with an H// region spectrum are classified as Low Excitation 
Line galaxies (LEX), implying sources without signs of nuclear activity in the optical (however, 
discriminating between a Seyfert II galaxy and an HII region galaxy involves the measure of 
line ratio as shown in Veilleux & Osterbrock (1987), which is not used here as a classification 
scheme, considering also that their classification scheme relies on lines which are not visible in 
optical spectra from the ground at z > 0.7). Objects with typical galaxy spectrum showing only 
absorption lines are classified as ABS; among the last two classes we expect to find star-forming 
galaxies or Narrow Line Emission Galaxies, but also hidden AGN. The optical identification is 
flagged according to the quality of the optical information. Quality flags Q > I indicates spec- 
troscopic redshifts (see Table In several cases, the optical spectral properties do not allow 



' see _http://asc.harvard.edu/ciao/threads/acistimegaiii/| 
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us to obtain a secure determination of the spectral type. As shown in Szokoly et al. (2004), the 
optical classification scheme is failing in identifying an AGN in about 40% of the X-ray sources 
optically classified as LEX or ABS. Therefore, an X-ray classification scheme, based on the 
source hardness ratio and observed X-ray luminosity, was worked out by Szokoly et al. (2004) 
and compared with the optical one (see their Fig. 13). In Section §7 we will reconsider this X-ray 
classification scheme using the intrinsic luminosities (as opposed to observed ones) and intrinsic 
absorption (as opposed to the hardness ratio). 

Optical and near-IR images of the CDFS are also used to derive photometric redshifts for 
all the remaining X-ray sources. Using the widest multiwavelength photometry available today, 
Zheng et al. (2004) and Mainieri et al. (2005a) derived photo-z for the whole sample of sources 
but four. Photometric redshifts are obtained from dififerent methods labelled with diff^erent quality 
flags (see Zheng et al. 2004 for details). When we have consistent redshift from more than one 
method, the corresponding quality flag is the sum of the single Q (always less than 1 for pho- 
tometric redshift). Given the good agreement of photometric redshifts with spectroscopic ones 
(see Zheng et al. 2004), we do not divide our sample according to the optical spectra quality. 
Indeed, our statistical analysis is not expected to be significantly afifected by uncertainties in the 
photometric redshifts. Uncertainties in the redshift estimate may instead significantly affect the 
search for the Fe line, as we discuss later. 

The total number of sources with spectral or photometric redshift z > is 336 over a total 
of 347 X-ray detections. Besides the 4 X-ray sources without any redshift estimate, we indeed 
identify 7 stars with good optical spectra. Therefore the spectral completeness of our sample 
of extragalactic sources is ~ 99 %. Since we want to focus on AGN, we adopt a conservative 
criterion and exclude 15 sources with total luminosity in the 0.5-10 keV band Lx < 10**' erg 
s a luminosity range which is expected to be dominated by normal or star forming galaxies. 
We note that the higher luminosiy range 10"*' < Lx < 10"*^ erg s"' may include several star 
forming galaxies as well, with star formation rate of the order of lOOMo/yr. However, we keep 
all the sources in the luminosiy range 10"*' < Lx < lO'*^ erg s ' to include any possible low- 
luminosity AGN in the sample. The final sample amounts to 321 sources. The redshifts with the 
corresponding spectral quality are shown along with the results from the X-ray spectral fits in 
TablelH 

3. The X-ray spectral analysis 

3.1. Fitting strategy 

We use XSPEC vl 1.3.1 (see Arnaud 1996) to perform the spectral fits. The ability of obtaining 
a reliable fit depends on the X-ray spectral quality, or, in simpler terms, on the signal to noise 
of the spectrum under analysis. The distribution of the net counts in the 0.5-7 keV band for all 
the sources in our sample, peaks below ^ 100 (see Figure[2l. The mean value of the net detected 
counts in the total 0.5-7 keV band for all the sources in our sample (including the two X-ray 
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brightest sources in the sample, with about 10000 counts each) is - 240 counts, while the median 
is much lower ^ 70 counts. 

Therefore, the strategy for the X-ray spectral analysis must be appropriate for the low counts 
regime. In performing the spectral fits we used an extension of the Cash statistics which makes 
use of both the source and background spectral files^. Cash statistics is applied to unbinned data, 
and therefore exploit the full spectral resolution of the ACIS-I instrument, allowing better per- 
formance with respect to the canonical analysis, particularly for low signal-to-noise spectra 
(Nousek & Shue 1989). In order to assess the ability of our fitting procedure in a typical case (a 
source with F = 1.7 and Nh - 5 x 10^^ cm"^ at z = 1) we run several simulations for different 
input fluxes, in which we try to recover the input parameters with two different fitting procedures: 
Cash statistics (unbinned) and the classic statistics with a binning of 10 photons per bin. The 
results are summarized in Figure |2] Note that we are forced to use a binning of 10 photons (as 
opposed to the commonly used binning of 20 photons) in order have a reasonable number of bins 
to perform the fits in the low-counts regime. Such a small binning is known to give inappro- 
priate weights for the analysis, therefore we do not mean to present a detailed comparison of 
the two methods. Indeed, here we just explore the effects that their use would have in the spectral 
analysis of our sample. For the;^'^ statistics, we find that for sources with a number of net counts 
equal or larger than 50, the input parameters are recovered with very good accuracy, while for 
lower values, the peak of the distribution of the best-fit-values starts to depart from the input 
value. The shift in the distribution of the best-fit values is a consequence of the binning, which, 
especially in the case of low-counts statistics, acts as an effective smoothing on the spectrum. 
On the other hand, the distribution of the best-fit values with Cash-statistics appears to be closer 
to the input values. In addition, the rms dispersion of best-fit values is significantly lower with 
respect to the;^'^ statistics. We also checked that the confidence levels for the Cash-statistics can 
be defined as in the ;t'^-statistics (i.e., AC = 1.0 corresponds to 1 cr, AC - 2.7 corresponds to 
90% c.l. for one interesting parameter). Therefore we choose to quote only the best fit values 
obtained with the Cash statistics. 

Of course, the weak signal of our faintest sources limits the ability to perform a fit keeping all 
the spectral parameters free. To determine the validity of our approach, we first run the fit for our 
default model with three free parameters (Nh, F and normalization) on all the sources with more 
than 40 net detected counts in the total 0.5-7 keV band^. First we focus on the distribution of the 
best-fit values for F as a function of the net counts (see Figure|3] left). We notice that for sources 
detected with a large number of counts (larger than ^ 200) the spectral slope is almost constant. 
On the other hand, at low counts, the best fit spectral slope F shows an apparent trend associated 
with a significant increase in the dispersion on F (see Figure|3j right); lower values at lower soft 

^ see'http : / /heasar c . gsf c . nasa . gov/docs/xanadu/xspec/manual/node 5 7 . html 
^ Given the low background of Chandra and the small extraction regions used for the sources, the corre- 
lation between signal-to-noise in a given band and total net counts is very tight. Therefore for simplicity 
we select our sources on the basis of the net detected counts. 
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counts, higher values at lower hard counts. In principle this is expected, since most of the sources 
with few soft counts are among the hardest sources, and they can be fitted with a flat power law, 
and viceversa the softest sources can be fitted with a very steep power law. However, we argue 
that this behaviour may be afi'ected by the poor statistics. To avoid any possible bias induced by 
the low statistics, we conservatively define an X-ray bright sample by considering those sources 
exceeding at least one of these thesholds: 170 total counts, 120 soft counts, 80 hard counts. As 
we can see in Figure (Sj), the threshold on the soft counts is particularly eflicient in selecting 
sources for which the statistical error on F is smaller than 20% (about 10% in average). The 
bright sample, constituted by 82 sources, will be used to investigate both the intrinsic spectral 
slope F and the intrinsic absorption Nh- We remark here that the bright sources are selected on the 
basis of the net detected counts, and not on the basis of the energy flux; among the brigth sample, 
we find sources with fluxes larger than 4 x 10"'^ erg s"' cm"^ in the soft and 1.3 x 10"'^ erg s"' 
cm"^ in the hard band. As for the remaining 3/4 of the sample, we decide to fix the slope to the 
canonical value of F ^ 1.8 (see Turner et al. 1997), which is, in turn, very close to the average 
value measured for our bright sample (as shown in §4), and focus on the intrinsic absorption. 

3.2. Spectral models 

We assume a default spectral model based on a power law (XSPEC model pow) and intrinsic 
absorption at the source redshift (XSPEC model zwabs) with redshift frozen to the spectroscopic 
or photometric value. Also, we search for the Fe Ka line at 6.4 keV rest-frame, which is one of 
the most common features of AGN X-ray spectra. To investigate the presence of such a line, 
we added a redshifted unresolved Gaussian line at 6.4/(1 + z) keV (Nandra & Pounds 1994). 
We also take into account the local Galactic absorption (XSPEC model tbabs) with a column 
density frozen to Nh = 8x10'^ cm"^ (from Dickey & Lockman 1990). The fits are performed on 
the energy range 0.6-7 keV. We cut below 0.6 keV to avoid uncertainties in the ACIS calibration 
in an energy range which anyway off'ers a small effective area. At high energies, the efficiency 
of Chandra is rapidly decreasing, and the energy bins at more than 7 keV are dominated by the 
noise for the large majority of the sources in our flux range. It has recently been shown that 
a methylen layer on the Chandra mirrors increases the effective area at energies larger than 2 
keV (see Marshall et al. 2003)^. This has a small effect on the total measured ffuxes, but it can 
have a non-negligible effect on the spectral parameters. To correct for this, we include in the 
fitting model a "positive absorption edge" (XSPEC model edge) at an energy of 2.07 keV and 
with T = -0.17 (Vikhlinin et al. 2005). This multiplicative component artificially increases the 
hard fluxes by ^ 3.5%, therefore the final hard fluxes and luminosities computed from the fit are 
corrected downwards by the same amount. 

In some cases, the fit with a simple absorbed power law may not be a good description 
of the X-ray spectrum. On the other hand, our limited counts statistics does not allow us to 

see _http://cxc.harvard.edu/ccw/proceedings/03-proc/presentations/marshalI2| 
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investigate for complex spectral shapes as often observed in AGN. However, we identify two 
possible additional spectral models. A first spectral model we investigate is the presence of a 
soft component in addition to the absorbed power law, as often found in the X-ray spectra of 
Seyfert 2 galaxies (e.g. Turner et al. 1997). Such a soft component can arise from several physical 
processes, like nuclear radiation scattered by a warm medium (the so-called "warm mirror", 
e.g. Matt et al. 1996), or nuclear radiation leaking through the absorber. In this cases, the soft 
component is expected to have the same spectral slope of the main power law. Here we do not 
consider the soft excess possibly due to thermal emission or comptonization of soft photons, as 
found in bright quasars (see Porquet et al. 2004). Thus, we repeated the fits simply adding to the 
Compton-thin model an unabsorbed power law component with slope equal to that of the main 
power law, requiring the intrinsic normaUzation of the soft component to be always less than 10% 
of the intrinsic normalization of the main power law. This last requirement embraces typical 
values both for a scattered component and for leaky absorbers (see Turner et al. 1997). This 
upper limit may exclude some leaky absorber with a low covering fraction, but at the same time 
helps us in avoiding false detections of high-normaUzation soft components implying spuriously 
high values of Nh relative to the absorbed component. With this procedure, a soft component is 
detected with AC > 2.7 in 8 sources. 

Moreover, when the intrinsic absorption is as high as Nh - 1.5 X 10^^ cm"-^, the Compton 
optical depth is equal to unity and the directiy transmitted nuclear emission is strongly sup- 
pressed in the Chandra soft and hard bands. In particular, for an intrinsic power-law spectrum 
with r = 1.8, the fraction of transmitted photons is less than 2% in the soft band up to redshift 
z = 2. Absorption is less severe in the hard band, where for z > 1 already a fraction of 10% of the 
emitted photons are recovered. It is clear that only the intrinsically brightest, heavily absorbed 
high-redshift AGN can be detected by their transmitted nuclear emission. In this regime, a radia- 
tion component reflected by a cold medium, expected to be in average 6% of the intrinsic power 
in the 2-10 keV band, starts to be important. For these Compton-thick sources, the most com- 
monly observed spectrum is dominated by a Compton-reflection continuum from cold medium, 
usuaUy assumed to be produced by the far irmer side of the putative obscuring torus. This can be 
modeled with the XSPEC model pexrav (Magdziarz & Zdziarski 1995) plus the redshifted Fe 
Khne. 

The pexrav model often provides a better fit for the sources in our sample with a flat spec- 
trum. For simplicity, we fix all the parameters to the default, typical values (F = 1.8, reflection 
relative normahzation=0, element and Fe abundance set to 1, cosine of incUnation angle set to 
0.45) but the normalization of the intrinsic power law spectrum. Our selection of Compton-thick 
candidates, then, is based on the comparison of the Cash-statistics obtained in the best fits with 
the zwabs pow model (with two free parameters, Nh and normaUzation) with that obtained with 
the pure reflection model (with only one free parameter, the normalization). The difference AC 
is an indication of the goodness of the pexrav model with respect to the standard absorbed 
power law. Due to the different number of free parameters and the low signal-to-noise typical 
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of our sources, we choose a threshold AC to select Compton-thick candidates after extensive 
simulations. The simulations procedure is described in Appendix B. We find that a threshold 
AC = 2 allows us to select a sample of Compton-thick candidates with a contamination fraction 
of about 20%. On the other hand, we also find that with our selection criteria, we may miss a 
fraction as high as 40% of the total Compton-thick population. Indeed, we find that, given the 
typical signal-to-noise of our sample, it is extremely difficult to efficiently select Compton-thick 
sources on the basis of the shape of the X-ray spectrum. We recognize that, in order to perform a 
careful search for Compton-thick candidates, other spectral features, Uke the Fe K line, or other 
wavelengths (like the submillimeter range of SCUBA) should be explored (see Alexander et al. 
2005b). This goes beyond the goal of this Paper 

To summarize, we label as C-thin the sources for which the best fit model is a power law 
with intrinsic absorption; C-thick the sources for which the best fit is given by a pexrav model; 
finally Soft-C for sources whose best fit model includes a soft component with the same slope 
of the main power law. Finally, we always add a gaussian component to model the Fe K line, 
which, in case of no detection, gives a null or negligible contribution to the spectral shape. 

4. Spectral slope for the bright sample 

First, we consider only the X-ray bright sample of 82 sources with more than 120 net detected 
counts in the soft band or more than 80 in the hard band, and more than 170 net counts overall. 
Among them, only two sources with soft component are found, and no Compton thick candidates. 
We note that the low fraction of sources with significant soft component, lower than that in the 
local sample of Turner et al. (1997), may be ascribed to the high redshifts in our sample, for 
which the soft component is often shifted below 0.6 keV. We use this subsample (1/4 of the 
total sample) to investigate the behaviour of the spectral slope F. The normalized distribution 
of spectral slopes for the X-ray bright sample is shown in Figure |5 The distribution has been 
obtained by extracting the value of F of each source lO** times from the range allowed by the 
statistical error bars, assuming a gaussian error distribution. With this procedure, we weight each 
source in the histogram according to the statistical errors on F. Before computing the weighted 
mean value, we exclude the two brightest sources in the sample (about 10"* net counts each) 
which otherwise would dominate the statistics. We find that the weighted mean value for the 
spectral slope of the bright sample is (F) = 1.75 + 0.02 (error bar refers to 1 cr uncertainty on 
the mean value). While the typical error on a single measure is about AF ^ 0.13, the dispersion 
of the distribution of the best fit values is cr ^ 0.33. Assuming that both statistical errors and 
the intrinsic dispersion in F are distributed as a Gaussian, the intrinsic scatter is of the order 
of cr,v„ ~ 0.30. If we focus on the 30 brightest sources to decrease the statistical errors (still 
excluding the two sources with ~ 10"* counts), the estimate of the intrinsic scatter decrease to 
(Tini ~ 0.20, and the weighted mean value is (F) = 1.81 + 0.01. 
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In Figure|5] we plot the best fit values of F versus the best fit values of the intrinsic absorption 
Nh- We do not detect any correlation between F and Nh (Spearman Rank coefficient SR ~ 
-0.04). Note that if the intrinsic absorption is close to the Galactic value for the CDFS field 
(NHgai - 8 X lO''^ cm"^) we are not able to derive any meaningful value, due to the low-energy 
limit of our spectral range (E > 0.6 keV). We considered these sources to be unabsorbed, plotting 
them at Nh = 10^" cm"- in our Figures. We detect no correlation between F and the hard rest- 
frame intrinsic (unabsorbed) luminosity (see Figure |6j. The Spearman Rank correlation is null 
also between F and the redshift (see Figure^}. 

From the analysis of the bright sample, we conclude that among our sources the intrinsic 
continuum is well approximated by a power law with F ^ 1 .8 (typical of Seyfert galaxies and 
AGN, as known also from ASCA studies of AGN, see Turner et al. 1997) at any epoch. On the 
other hand, it is well known that the flattening of the average spectrum of the sources at low 
fluxes in deep X-ray survey is due mainly to increasing intrinsic absorption (see Ueda et al. 
1999b; Tozzi et al. 2001; Piconcelli et al. 2003; La Franca et al. 2005). In addition, previous 
studies found no hints for a change in the slope of the intrinsic power law as a function of 
epoch or luminosity (see also Mainieri et al. 2002; Piconcelli et al. 2003; Vignali et al. 2003). 
We conclude that the slope of the intrinsic power law can be assumed to be constant for all the 
AGN population, and, therefore, we choose to fix the spectral slope to F = 1.8 when fitting the 
remaining fainter sources, focusing on the A^^ distribution for the whole sample. 

5. Results for the complete sample 

We complete the analysis of the total sample fixing F = 1.8 and deriving Nh for the remaining 
faint sources (239/321). We remark that our division in a bright and a faint subsample does 
not correspond to a dramatic selection in redshift. Indeed, the X-ray bright and the X-ray faint 
subsamples have a similar distribution in redshift (see Figure The results of the fits, along 
with the redshifts and the quality of the optical spectra, are shown in Table[2 

The distribution of the absorbing column densities is shown for the whole sample in Figure|9] 
Our results are in good agreement with preliminary results from the CDFN (Bauer et al. 2004a). 
The distribution has been obtained by extracting the value of Nh of each source 10^ times from 
the range allowed by the statistical error bars, assuming gaussian errors. When the lower cr error 
bars hit zero, we adopt the upper error bar to allow the Nh resampled value to go below zero; in 
this case, the resampled values are included in the lowest bin. The lowest bin shown is the value of 
the Galactic absorption, Nh - lO"" cm"^, below which we cannot measure the intrinsic absorp- 
tion, especially at high redshifts. This bin includes all the sources with nominal Nh best fit value 
lower than 10^" cm"^. Among these sources we expect both redshifted AGN with low absorbing 
columns and normal X-ray galaxies. Note that here Nh is an equivalent hydrogen column mea- 
sured assuming the photo-electric cross-sections by Morrison & McCammon (1983), with metal 
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abundances relative to Hydrogen by Anders & Ebihara (1982). The last bin at Nh - lO^'* cm"^ 
includes the few sources with measured Nh > 10^"* cm"^ and the Compton-thick candidates. 

We look for the Fe line only in those sources having at least 10 net counts in both bands, to 
have an acceptable estimate of the continuum and avoid spurious measures of high equivalent 
widths. Adopting a threshold AC > 2.7 with respect to the fit without the line, corresponding to 
a minimum 90% c.l. for one interesting parameter, we find evidence for a significant Fe line in 
20 sources with at least 10 net counts in both bands. The corresponding equivalent widths span 
the 100-3000 eV range. We carefully checked that our criterion AC > 2.7 actually corresponds 
to more than 90% c.l. also in the case of a line detection (for which the canonical confidence 
level criterion cannot be applied, see Protassov et al. 2002). For each X-ray source we simulated 
500 spectra starting from the observed best fit model without the line. We then fitted each sim- 
ulated spectrum and looked for any variation in the C-stat when adding a Fe line at 6.4/(1 + z). 
The frequency of occurrence of AC,,,,, > ACobs gives the probability P that the detected line is a 
statistical fluctuation. In FigureFTOIwe show the significance (l-P) of the Fe line versus the mea- 
sured AC. We conclude that in the large majority of the cases the criterion AC > 2.7 corresponds 
to a confidence level greater than 95%. Among the sources with more than 10 counts in both 
bands and a significant Fe line, 14/1 16 (~ 12%) are found among the sources with spectroscopic 
redshift, and only 6/125 (~ 5%) are found in the subsample with photometric redshift. This 
shows that, given our X-ray spectral resolution, the uncertainties in the photometric redshifts are 
likely to negatively affect the detection of the Fe line with our method, i.e., fixing the expected 
observing-frame energy of the line. Indeed, we notice that some sources do show strong hints of 
a Fe line at a redshift different from the photometric one (see Mainieri et al. 2005a), or peculiar 
lines (see Wang et al. 2003); finally, source variability could hide the emission line (see Braito 
et al. 2005). Therefore, we conclude that the fraction of sources with significant emission line 
is slightly larger than that found in an X-ray bright subsample in the CDFN (7%, see Bauer et 
al. 2004b). In principle, if the Fe line were produced only by the interaction of photons with the 
absorbing medium, a positive correlation between Nh and equivalent width might be expected 
in obscured sources (Leahy & Creighton 1993; Ghisellini, Haardt & Matt 1994). As shown in 
Figtn we do not find strong evidence of a correlation given the scatter of our data points, as 
already observed (see Mushotzky, Done & Pounds 1993). The Fe lines measured with low in- 
trinsic absorption (Nh < 10^^ cm"^), may be produced by the accretion disk, therefore breaking 
the expected correlation. 

In figure [21 we show the scatter plot of intrinsic absorption as a function of redshift for the 
whole sample. We note the lack of sources with high absorption [Nh > 10~^ cm"^) at z < 1. This 
is due to the fact that the low-luminosity, low-z sources with high absorption show a strongly 
suppressed flux, and only the intrinsically more luminous, rarer sources can be detected for a 
given threshold in count rate; the detection probability, then, decreases due to the small volume 
probed at low-z. We also note a lack of sources with low absorption (around Nh ~ 10^' cm"^) 
at high z. This effect may be due to the difficulty in measuring Nh at z > 2, since the absorption 
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cutoff is redshifted below the lower limit of the Chandra energy band we use (0.6 keV). This 
effect could result in spuriously high values of Nh with large error bars. Note, however, that some 
of the points are just 1 cr upper limits, implying the presence of sources with low Nh value at 
high redshift as well. It is clear that the Nh-z scatter plot shows the effects of the incompleteness 
and partial sampling of the AGN population. Before investigating the shape and evolution of the 
intrinsic Nh distribution, we must correct for the number of sources with a given Lx, Nh and z 
that fall outside our detection criteria. We will do this in the next Section. 

In Figure^lwe show the scatter plot of Nh versus the intrinsic, unabsorbed luminosities in 
the soft and in the hard band. We remark that the intrinsic luminosities are computed in the rest- 
frame soft and hard bands setting to zero the intrinsic absorption in the XSPEC best fit model; for 
the Compton-thick candidates we measure the intrinsic luminosities using a power law model 
with r = 1.8 and normalization fixed to that of the best fit pexrav model. With this assumption 
the emitted (reflected) luminosity of the C-thick sources is always about 6% of the intrinsic one 
in the hard band (while only 0.2% in the soft). We also note that this model may give a lower 
limit to the intrinsic luminosity, since its assumes a maximally efficient reflection; the intrinsic 
luminosity can be higher for lower refelection efficiency (Ghisellini, Haardt & Matt 1994). The 
envelope at low luminosity and high Nh is due to the fact that our survey is flux limited. The 
luminosity lower limit at a given redshift is not sharp, for two reasons: first, our survey is count- 
rate limited, and different spectral shapes may correspond to different fluxes and luminosities for 
the same count rates; second, the unabsorbed luminosities are related to the observed fluxes by 
a correction that depends on the measured Nh- For a preliminary investigation of a correlation 
between Nh and intrinsic luminosity, we select two regions in FigEl i) Lx > 10*^ erg s"' and 
Nh < lO^'* cm"^; ii) Lx > 10"*^ erg s"' and Nh < 10^^ cm"^. In this way we try to minimize the 
effects due to the flux-limited nature of our sample. In the first case, we do not find significant 
correlation between Nh and hard luminosity (Spearmann Rank coefficient SR = 0.06 for 154 
sources). In the second case as well, we do not detect significant correlation between Nh and 
hard luminosity (Spearmann Rank coefficient SR = 0.08 from 184 sources). This result is not 
in disagreement with results obtained from larger samples. Indeed, in flux-limited samples, the 
dependence of the absorbed fraction on luminosity tends to be much weaker, as discussed by 
Perola et al. (2004). In the following, we will not introduce by hand the correlation between the 
absorbed fraction and luminosity found in larger sample spanning more than six decades in flux. 
The inability of retrieving in our sample such a correlation, will not affect our main results, like 
the intrinsic distribution of Nh, with the caveat that we are probing the luminosity range up to 
few xlO'*'* erg s"^ 

In Figure^lwe also show the locus of Typell QSO, which is the upper right corner marked 
with the dashed lines. The criterion is Lx > IQ'^* erg s"' and Nh > 10^^ cm"^. For a spectral slope 
of r = 1.8, a total luminosity of 10"*"* erg s"' in the 0.5-10 keV band corresponds to 3.9 x 10"*^ 
erg s ' in the 0.5-2 keV band and 6.1 x 10"*^^ erg s ' in the 2-10 keV band. With these criteria, 
using X-ray spectral parameters and, most importantly, unabsorbed luminosities, the number of 



14 P. Tozzi et al.: X-ray spectral properties of AGN in the CDFS 

QSOII in the CDFS sample is 54. This corresponds to a surface density of X-ray selected QSO 
equal to (620 ± 80) sq deg"^ at the flux limit of 5 x 10"'^ erg cm"^ s"' . This is higher than the 
value found by Padovani et al. (2004), but the difference is due to their selection based on the 
condition L2-10 > ^0*'^ erg s"'. Applying the same criteria, we find a surface density of (360 ±50) 
sq deg"^ in very good agreement with Padovani et al. (2004; see also La Franca et al. 2005). We 
note, however, that the density of Typell QSO depends sensitively on the luminosity cut in the 
intrinsic power used in the analysis. 

Finally, we present a sample of 14 Compton-thick candidates selected only on the basis of 
the X-ray spectral shape with the selection thresholds described in §3.2. Two of them were al- 
ready identified as Compton-thick sources on the basis of multiwavelength data (source ID 202 
and 263, see Norman et al. 2002; Mainieri et al. 2005b). We assign a value Nh > 1.5 x 10^"* cm"^ 
to our Compton-thick candidates. Among them, 2 sources (out of 7 with secure spectroscopic 
redshift) show a Fe K emission line, while no Compton thick candidate source with photometric 
redshift does show a statistically significant line. We believe that the uncertainties in the photo- 
metric redshift prevent us from recovering the line. We also note that some high column density 
sources at low redshift may not have strong Fe K lines (see Fruscione et al. 2005). We checked 
that the distribution of the net detected counts of the C-thick candidates is not diff'erent from that 
of the whole sample, indicating that there are no evident bias due to the low signal-to-noise. The 
net-detected counts for the C-thick sample ranges from 170 to 40, with an average of 65. We no- 
tice that for these sources the detection probability is low, due to their hard spectra. Consequently, 
their associated sky-coverage is low, and their surface density correspondingly higher, close to 
(200 + 50) deg"^. The actual surface density of C-thick sources may be 20% higher if including 
selection effects (see Appendix C). We notice also that the fraction of C-thick sources predicted 
by updated models for the synthesis of the XRB is in very good agreement with that found in the 
CDFS (Gilli, Comastri &. Hasinger 2006, in preparation). 

6. Intrinsic absorption distribution and its evolution with cosmic epoch 

In this Section, we estimate the intrinsic absorption distribution (the Nh function) for the AGN 
population in our sample. The distribution of Nh that we showed in Figure O does not include 
any correction for incompleteness, and it refers only to the sources observed in the region of 
the Nh-Lx-z space which is delimited by the count-rate detection thresholds of the survey. To 
go from this distribution to a distribution which is representative of the whole AGN population, 
we must apply two independent corrections. The first is the completeness correction and it is 
given by the eff'ective solid angle under which a source of a given intrinsic luminosity, absorbing 
column density and redshift, is detected in the CDFS with our criteria. The second correction 
takes into account the sources which are outside the detectability region in the Nh-Lx-z space, 
and therefore it must be based on a specific model of the luminosity function of AGN. We remind 
that a reliable luminosity function cannot be obtained from CDFS data alone, but should rather be 
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derived from a combination of wider surveys, in order to sample the bright end of the luminosity 
distribution, which is poorly represented in our pencil beam survey (see Brandt & Hasinger 
2005). We describe these two corrections below. 

To correct for incompleteness, we simply weight each source for the inverse of the soUd 
angle under which the source can be detected in the CDFS. To measure this quantity, first we 
compute the net count rate in the soft and hard band that would be measured in the aimpoint of 
the CDFS for each source in the sample, using its best-fit model. Then, we measure the soUd 
angle a>, where the ith source can be detected in the CDFS, including the vignetting correction 
and the background evaluated locally. Since the detection threshold is appUed separately in the 
hard and the soft image, the effective soUd angle is the largest between the two. We recall that our 
survey is limited in count rate, not in flux, and for a given intrinsic luminosity and redshift, the 
count rate is strongly dependent on the intrinsic absorption, especiaUy in the soft band, where the 
sensitivity of our survey is the highest. Most of the sources have the largest detectabiUty angle 
in the soft band, while the fewer, strongly absorbed, hard sources have the largest detectabiUty 
solid angle in the hard image. The a priori probability of having a given source included in the 
CDFS sample is simply the ratio of the soUd angle to the total solid angle covered by the 11 
exposures of the CDFS {ojcdfs = 0.108 deg^). Then, when birming our sample as a function of 
the measured A^^, we weight each source for the inverse of its detection probabiUty: 



Here, the weight P, would be equal to 1 if Nh were measured with negligible error with 
respect to the size dNu of the bin. To account for statistical uncertainties in the measured value 
of Nh for each source, we put P, equal to the probability that the actual value falls within the Nh 
bin, according to the best fit value and its error bars. The error on F{Nh) is the poissonian error 
associated to the number of sources counted in the bin Nh-Nh + dNn- 

Then, we compute the second correction, to account for the sources which are outside the 
detectabiUty region in the Nh-Lx-z space in the CDFS survey. This correction is relevant for 
strongly absorbed sources, since our Umit in count-rate aUows us to sample a smaller range of 
intrinsic luminosity for increasing Nh at a given redshift. This effect is mitigated at high redshift 
due to the positive X-ray K-correction. Therefore, for any given redshift and luminosity, we 
are measuring a difi'erent fraction of unabsorbed and absorbed sources with respect to the total 
AGN population. As a consequence, the directly observed fraction of sources with a given Nh is 
aff'ected by the shape of the actual AGN luminosity function and by its cosmic evolution. 

To correct for this efl'ect, we must assume a model for the AGN luminosity function. One 
of the most recent is the Luminosity Dependent Density Evolution model obtained by Ueda et 
al. (2003; but see Barger et al. 2005 for another determination of the AGN X-ray luminosity 
function consistent with pure luminosity evolution), in which low-luminosity sources peak at 
lower redshift than high-luminosity AGN. Such a luminosity function is measured from a com- 
bination of surveys with HEAO-1, ASCA and Chandra including part of the CDFN sample (see 
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also Hasinger, Miyaji and Schmidt 2005 for the most recent measure of the Type I AGN lumi- 
nosity function). In particular, we use equations 11-15-16-17 of Ueda et al. (2003) to write the 
comoving density of AGN per hard-band luminosity interval N(Lx, z). 

After assuming a luminosity function for the whole AGN population, we can write the num- 
ber of AGN in a given interval of Nh, Lx and z as 



where V is the comoving volume element, and f{NH,L,z) is the probability of measuring an 
intrinsic absorption between Nh and Nh + iJNh for a given Lx and z- Let's assume that /{Nh, L, z) 
is slowly varying as a function of Lx and z in our sample. The total number of sources that we 
are detecting in our survey with intrinsic absorption between Nh and Nh + dNn is then given by: 



Here the luminosity Lcut{NH,z) is the 2-10 keV intrinsic luminosity for which, at any given z 
and Nh, the net count rate is equal to the minimal count rates in the hard or in the soft band. The 
minimal count rates for detection in the aimpoint of the CDFS are 1 .2 x 10"^ cts/s in the soft and 
1.5 X 10"^ cts/s in the hard band. These values are defined with small uncertainties because of the 
rapid drop of the sky coverage as a function of the count rate in both bands. To compute Lcm, we 
assume that in average our sources can be described with a Compton-thin model with spectral 
slope fixed to F = 1.8, plus a reflection component with the same slope and normahzation. The 
reflection component (modeled with the pexrav XSPEC model) amounts to 6% of the hard 
intrinsic luminosity. Such a reflection component will dominate the emission of the Compton- 
thick sources with Nh > 1 .5 x lO^'* cm"^. The value of Lcut as a function of Nh is shown in Figure 
ll4lfor different redshifts. We note that for unabsorbed sources (Nh < 10^^ cm"^) the cut depends 
only on the intrinsic luminosity at any redshift. However, for larger column densities, the cut in 
luminosity is higher for larger Nh, but the effect is weaker at higher z where the positive X-ray 
K-correction shifts the hard rest-frame emission in the soft band. In the Compton-thick regime, 
a roughly constant fraction of the intrinsic luminosity reflection by cold material dominates the 
emission, making Lcut flat again. We do not attempt to include the effect of the presence of the 
scattered component, which is detected only in less than 3% of the sources in our sample. 

Since F{Nh) (computed with equation 1) is the directly observed Nh distribution (after cor- 
recting for incompleteness), the probability function /(Nh) can be obtained after equation 3 
(discretizing the integral over Nh)- The resulting fraction of AGN visible in the CDFS as a func- 
tion of Nh is shown in Figure^] for three different redshift intervals (solid hnes), and for the 
whole explored redshift range (thick dashed line). This fraction is computed as the ratio of the 
detectable AGN over the total number of AGN predicted by the Ueda et al. luminosity function 
in the range L„a.x - 10"*^ ergs s"', L,„,„ = lO'*' ergs s"', and Zma.x - 5. Note that the low val- 
ues of this fraction does not imply that the majority of the AGN are not detected in the CDFS; 
in fact, such low values are mostly due to the conservatively low minimum luminosity adopted 
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here (L,„,„ - 10*^ ergs s ') and depend on the faint end slope of the luminosity function. These 
aspects, in turn, weakly affects the dependence of the fraction on Nh, which is our main concern 
here. Here we do not discuss the effects of the shape of the underlying luminosity function, post- 
poning this to a subsequent paper Therefore, we estimate in a robust way the dependence of the 
total fraction of visible AGN on the redshift (given the flux limit in the CDFS) and on Nh- The 
fraction decreases towards higher values of Nh due to the reduced emission in the soft band, but 
it flattens again in the Compton-thick regime, where the emitted luminosity is roughly a constant 
fraction of the intrinsic one. 

The corrected, normalized distribution of the intrinsic absorption for the whole sample is 
shown in Figure Errors are obtained from the poissonian uncertainties on the number of 
detected sources in each bin. The distribution that we measured is bimodal, in the sense that 10% 
of sources have Nh < 10^" cm"^ and appear separated from the distribution of the bulk of the 
sources. However, we remark that the fraction of sources with negligible absorption in our sample 
may include normal galaxies with high star formation rate. The distribution of the bulk of the 
sources can be roughly approximated with a lognormal distribution centered on {logiNn)) - 23. 1 
and with a dispersion cr - 1.1. We remark that in the Compton-thin regime, where our estimates 
are more robust, the number of obscured sources is steeply increasing with Nh in agreement with 
Risaliti et al. (1999) and Dwelly et al. (2005). 

This distribution accounts for the Compton-thin sources with intrinsic absorption up to Nh - 
10^"* cm"^, and for Compton-thick sources at higher absorption, bridging the bulk of the AGN 
to the Compton-thick population. This is the main difference with the distribution presented in 
Treister et al. (2004), where the fraction of sources with Nh > 10^^ cm"^ is dropping. Indeed, 
strongly absorbed AGN are expected to be missed by surveys that rely on optical spectroscopy. 
Here we show that part of the population of Compton-thick sources can be detected in present 
deep X-ray Surveys via a careful spectral analysis of all the X-ray detected sources, avoiding 
selection based on optical spectroscopy. Our results are consistent with the preliminary results 
on the Nh distribution found in the CDFN (Bauer et al. 2004a), which already shows a peak at 
larger Nh values with respect to the results of Ueda et al. (2003). We remark that this result is 
not affected by small variations with respect to the luminosity function proposed by Ueda et al. 
(2003), which indeed is consistent with the present data on the AGN luminosity distribution. To 
summarize, we conclude that at least part of the expected population of strongly absorbed AGN 
(expected to be observed in the submillimiter with the Spitzer satellite) is akeady present in the 
deep X-ray Survey such as the CDFS. 

The Nh function is derived under the assumption of no strong intrinsic correlation between 
L and Nh or z and Nh in our sample, so that we can obtain /{Nh) without binning our sample 
as a function of Lx or z- However, here we investigate for possible evolution with redshift of 
the absorbed fraction of sources. Due to the limited statistics, we focus on the cosmic evolution 
of the ratio of absorbed sources (Nh > 10^^ cm"^) to all the AGNs in three bins of redshift. 
The redshift bins are z = - 0.7, 0.7 - 1.5, 1.5 - 5, including 76, 125 and 109 sources with 
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Lx > 10"^' erg s"' respectively (the first two bins include the two most prominent spikes in the 
CDFS redshift distribution at z = 0.67 and z = 0.73, as shown in Gilli et a. 2003). The correction 
for the absorbed sources that are missed is larger at low redshift, as can be seen in Figure [Tsl 
(upper curve for the redshift range z = 0-0.7), while at high z is almost flat up to log(A^/f)= 23.5 
(lower curve for the redshift range z - 1.5-5). In Figure [TTl we show that the absorbed fraction 
is consistent with a moderate increase, in agreement with the model of Gilli et al. (2001; see also 
Civano, Comastri & Brusa 2005). We remark that the absorbed fraction in the first bin at z < 0.8, 
including the low luminosity sources, may be underestimated due to the presence of star forming 
galaxies in the luminosity range 10"^'-10^^ erg s 

We note that the overall value of the fraction of absorbed sources is larger than that found by 
Ueda et al. (2003). However, the points of Ueda et al. (2003) include only sources with Lx > 10'*^ 
erg s and therefore are expected to be significantly higher when including lower luminosities. 
The global fraction of absorbed sources is in agreement with that estimated in the CDFN by 
Perola et al. (2004), and with a ratio of absorbed over unabsorbed sources in the sample of about 
4, as observed in the local Universe (eg. Maiolino & Rieke 1995). This value is also consistent 
with the theoretical expectation of 3/4 of all the AGN being absorbed as in the standard unifica- 
tion scenario (Antonucci 1993). While in the CDFS and CDFN the fraction of obscured sources 
seems to be in agreement with the expectations of the standard unification scenario and popular 
synthesis models of the X-ray background, in shallower serendipitous surveys like those per- 
formed with XMM by Piconcelli et al. (2003) and Mateos et al. (2005) obscured sources seem 
to be a factor of ^ 2 less abundant. At typical X-ray fluxes of a few 10"'"* erg s"' cm"^, XMM 
serendipitous sources have a median luminosity of a few lO'*'* erg s ' . It is therefore possible that 
the intrinsic fraction of obscured sources is decreasing at luminosities higher than that observed 
in the Chandra Msec fields, which would point towards a paucity of obscured QSOs as found 
by Ueda et al. (2003; see also La Franca et al. 2005). Alternatively, one could argue about the 
large spectroscopic incompleteness of XMM samples (more than 60% of the sources are as yet 
unidentified) before drawing solid conclusions. 

7. Comparison between X-ray and Optical properties 

If we classify the whole sample of 321 sources with Lx > 10"^' erg s"', according only to the 
optical spectra, we obtain the following: 

- 34 Broad Line AGN (BLAGN); 

- 20 High Excitation Line galaxies (HEX); 

- 67 Low Excitation Line Galaxies (LEX); 

- 22 Absorption spectrum typical of late-type galaxies; 

- 178 non classified. 

In this section we compare the optical classification with the X-ray classification, to in- 
vestigate if a revision of the unification model is actually needed (see, e.g.. Matt 2002). This 
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was already done in Szokoly et al (2004); the main difference here is that we use unabsorbed 
luminosities and intrinsic absorption as opposed to absorbed luminosities and hardness ratio, 
providing therefore a more physical X-ray classification. We use the value Nh - 10^^ cm"^ as 
the threshold to divide X-ray unabsorbed sources from X-ray absorbed ones. We define nor- 
mal X-ray galaxies the sources with Nh < 10^^ cm"^ and Lx < 10"*^ erg s"'. Our results are 
shown in Table|3 to be compared with Table 8 of Szokoly et al. (2004). We remark that the class 
"normal galaxies", amounting to 42 sources, may include low luminosity AGN. Indeed, if we 
restrict our criterion to source with low intrinsic absorption (values Nh < 10^' cm"^ can be due 
also to diffuse matter in the host galaxy, as opposed to the larger absorbing columns typical of 
circumnuclear matter), the normal galaxies class would include 23 sources only. Therefore, we 
can bracket the contamination of our sample by normal galaxies to be between 7% and 14% of 
the total sample. 

We also plot the normalized distribution of the intrinsic absorption and hard luminosities 
for the four optical classes in Figures ^] and ^] Here we account for the statistical errors by 
resampling each value according to its error bars, but we do not introduce any correction for 
selection effects, since here we are dominated by optical selection criteria. We find that, as ex- 
pected, the BLAGN class mostly includes AGN with low absorbing column densities; among the 
34 BLAGN, only 7 sources have A^^ > 10^^ cm"^; they give a fraction of 0. 18 of BLAGN with 
Nh > 10^^ cm"^, after accounting for statistical errors. This fraction is somewhat larger than that 
found in shallower surveys by Perola et al. (2004) and in the ChaMP survey by Silverman et al. 
(2005). However, we notice that most of the absorbed BLAGN are at z > 2. Due to the large 
errors expected when measuring Nh in high redshift sources, we do expect a scatter towards high 
values increasing with redshift. A spurious trend Nh <^ (1 + z)^ may be visible if we simply plot 
the best fit values for Nh- We carefully checked with simulations with XSPEC that the error bars 
keep track of this effect, being larger at higher z. In these simulations, described in Appendix C, 
we show that in the hypothesis of Nh - for all the BLAGN sources, we should expect none 
of them to have Nh > at 2 cr c.l. Instead, we find five of them to have Nh > 10^^ cm"^ at 
2 cr. Using the better count statistics and the larger energy range (E > 0.2 keV) of XMM (see 
Streblyanska et al. 2004), the spectral analysis of 5 of these sources gives absorption in the range 
10^' < Nh < 10^^ cm"^, confirming that these BLAGN have a non-negligible absorption, but 
that the Chandra best-fit values are somewhat higher, possibly due to the limited energy range 
used which may hamper the measure of low column densities at high z. To summarize, we put a 
strict upper limit of 18% for absorbed sources {Nh > 10^^ cm"^) within BLAGN. 

Absorbed AGNs with A^^ > 10^^ cm"^ are found mostly in the HEX and LEX classes (80% 
and 60% respectively). They are also found in the ABS class, where, however few sources have 
Nh > 10^^ cm"^. We find less evidence for Narrow Line AGN (here classified as HEX) with low 
absorption. We observe only about ~ 10% of such sources, for which the most likely scenario 
is severe dilution of the AGN optical emission by the underlying galaxy. Therefore, the simple 
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identification scheme of unabsorbed AGN with optical Type I (BLAGN) and absorbed AGN with 
optical Type II (HEX and LEX) is roughly coiTect, with uncertainties of less than 20%. 

As for the hard luminosities (Figures[T9t. we show that the BLAGN and HEX classes have X- 
ray luminosities in the range 10'*^-10"*'* erg s ' typical of AGN, with very few sources below 10"*^ 
erg s '. The value 10"*^ erg s ' can be considered as an effective luminosity threshold dividing 
AGN and normal or star forming galaxies, except for few cases of galaxies with a strong starburst, 
which can reach Lx ~ 10"*^ ergs s"' for a star formation rate of about 100 Mo/yr (Ranalli et al. 
2003). This luminosity range, where the presence of normal galaxies is expected to be significant, 
starts to be progressively populated in the LEX and ABS classes. However, also for the HEX 
class the majority of the sources have luminosities Lx > 10^^ erg s"', and only the ABS class is 
consistent with being an equal mix of galaxies and AGN. The distribution of the intrinsic rest- 
frame luminosities in the hard bands shows that broad line AGN have larger intrinsic luminosities 
than narrow line AGN, as noted by Barger et al. (2005). In particular, the fraction of BLAGN in 
our sample among the sources with optical spectra, is strongly increasing with luminosity, while 
their average luminosity is increasing with redshift, in agreement with the findings of Steff'en et 
al. (2003), as shown in FigureEOl However, due to our small sampling volume at low redshift, to 
the low optical spectral completeness of our sample (^ 1/3), and, finally, to the possible effect of 
the stellar dilution that may hinder the presence of broad lines (see, e.g., Moran et al. 2002) we 
do not draw strong conclusion on this aspect. 

We note also that, given the intrinsic luminosities and the intrinsic absorption values found 
in the remaining subsample of 178 sources without a clear optical classifications, about 90% of 
them are expected to be secure AGN. Overall, we find that at least 80% of the AGN with spectral 
ID in our sample agrees with simple AGN unification models (Antonucci 1993), confirming 
findings of wider and shallower surveys (see, e.g., Silverman et al. 2005). 

8. Conclusions 

We presented the detailed spectral analysis of 321 sources in the CDFS, taking advantage of 
spectroscopic and photometric redshifts. We fitted the source X-ray spectra assuming a default 
model consisting in a single power law with intrinsic redshifted absorption (plus a local ab- 
sorption frozen to the Galactic value in the direction of the CDFS) and a Gaussian line at the 
redshifted energy of the Fe K line complex. We look for sources with a spectrum dominated 
by a reflection component (Compton-thick candidates) and for sources showing an unabsorbed 
scattered component at soft energies. We are able to derive the spectral slope distribution for the 
82 brightest sources in the sample and intrinsic absorbing column density for the whole sample. 
Then, from the observed Nh distribution, we derive the intrinsic Nh distribution for the whole 
AGN population, after correcting for incompleteness and for the differential sampling of the 
AGN population as a function of intrinsic luminosity and Nh (modelling the luminosity function 
of AGN after Ueda et al. 2003). We accounted for statistical errors in our measures by convolving 
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the distributions according to the error bars associated to each measurement. We also look for 
evolution in the fraction of absorbed sources as a function of the redshift. Our main results are 
summarized as follows: 

- We investigate the spectral slope of the intrinsic spectrum for the 82 sources of the X-ray 
bright sample, excluding the two brightest that otherwise would dominate the statistics. We 
find that the average value for the slope of the power law is (F) ^ 1.75 ±0.02, with an intrinsic 
dispersion of the order of cr,„, ^ 0.30. 

- We find no correlation between the spectral index T and the intrinsic absorption column 
density Nh nor the intrinsic luminosity. We do not detect any evolution of the average F with 
redshift. 

- We select 14 Compton-thick candidates, for which we can only assess a lower limit to the 
intrinsic column density of Nh > 1.5 x 10^"* cm~^. Due to their low detectability, the surface 
density can be as high as (200 ± 50) deg"^. 

- We find significant evidence (at more than 90% confidence level) of a Fe line in 20 sources, 
most of them (14) for the sources with spectroscopic redshifts. We also find unabsorbed soft 
emission, fit with a power law model with the same slope as the main power law, possibly 
associated with a scattered component, in only 8 sources. 

- The intrinsic Nh distribution is well approximated by a lognormal distribution centered on 
{log(NH)) - 23.1 and with a dispersion cr = 1.1. This distribution differs from that found by 
Ueda et al. (2003), which shows a broader peak at lower values of Nh- Our distribution in- 
cludes the contribution of many more absorbed AGN, since we explored the faint X-ray flux 
range, where strongly absorbed sources dominate in number. This shows that the population 
of Compton-thick AGN (expected to be observed with the Spitzer satellite) is at least partly 
accounted for in deep X-ray surveys when all the X-ray selected sources are included. 

- We find hints that the fraction of absorbed sources is increasing with redshift, consistently 
with XRB synthesis models. 

- We find that the simple unification model, i.e. the one-to-one correspondence of unab- 
sorbed/absorbed X-ray sources to Typel AGN-QSO/Typell AGN-QSOII, is accurate for 
at least 80% of the sources with spectral identification (~ 1/3 of the total X-ray sample). 

We remark that once the ongoing or planned spectroscopic follow-up of the many Chandra 
and XMM surveys will be completed, the same kind of detailed spectral analysis will be per- 
formed on a much larger number of sources. This will allow one to fimnly understand the distri- 
bution of spectral properties among AGN, and to suggest improvements to the unification model 
in view of the complex relation between X-ray and optical types. 
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Table 1. Best fit parameters for the whole sample of sources in the CDFS with a measured spectroscopic 
or photometric redshift. Error bars correspond to 1 cr c.l. Luminosities are computed for a flat A = 0.7 
cosmology and Hq = 70 km/s/Mpc. ID are from Giacconi et al. (2002). Quality flags with Q > I indicate 
optical spectral quality: Q = I corresponds to spectra with a single optical line identified; Q = 2 indicates 
secure spectra but uncertain counterpart; 2 = 3 indicates secure redshift. Photometric redshifts are obtained 
from different methods described in Zheng et al. (2004): HyperZ ((? = 0.2), BPZ (Q = 0.3), COMBO-17 
(Q = 0.4 Wolf et al. 2001; 2003; 2004). When we have consistent redshift from more than one method, the 
corresponding quality flag is the sum of the single Q. 
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Table 2. Comparison of the optical and X-ray classification 





BLAGN 


HEX 


LEX 


ABS 


Undet 


X-ray AGN-1 and QSO-1 


27 


2 


13 


3 


40 


X-ray AGN-2 and QSO-2 


7 


18 


42 


10 


118 


X-ray galaxy 








12 


9 


21 



p. Tozzi et al.: X-ray spectral properties of AGN in the CDFS 



37 



T 1 — I I I I I I \ 1 — III 



~l — I I I I I 1 1 — I I I I I 



n 



-J] 



-n 



L I 



L I 



Ml 




I r ^ 



I 



I. I I 



^' n 

1 1 1 1 1 1 ill I 



10 100 1000 

Net Counts 



104 



Fig. 1. Distribution of the net detected counts for all the sources in the sample (solid line: 0.5-2 
keV band counts; dashed line: 2-7 keV band counts). 
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Fig. 2. The average best-fit parameters (with rms dispersion) for a source with intrinsic Nh - 
5 X 10^^ cm"^ and F - 1.7 at z = 1 fitted with Cash statistics (filled circles, continuous error 
bars) and (filled squares, dashed error bars) versus the number of net detected counts in the 
0.5-7 keV band. The;^'^ points are slightly shifted along the x-axis for clarity. The same source 
is simulated 1000 times for five different intrinsic normalizations, resulting in a different average 
number of net detected counts. 
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Fig. 3. Left: best fit values of the spectral slope F as a function of the net counts in the soft (top 
panel) and hard (bottom panel) bands (for sources with more than 40 net counts in the 0.5-7 keV 
band). Right: fractional statistical error (Icr c.l.) on F as a function of net counts in the soft (top 
panel) and hard (bottom panel) band (upper and lower errors are shown as triangles and upside- 
down triangles respectively). Vertical dashed Unes are the thresholds adopted to select the bright 
X-ray sample (82 sources with more than 120 net counts in the soft band or more than 80 in the 
hard band, or more than 170 counts overall). 
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Fig. 4. Distribution of the spectral slope T for the X-ray bright sample (82/321 sources). Error 
bars are 1 cr poissonian errors. The solid vertical line refers to the central value (F) - 1.75. 




Fig. 5. Scatter plot of the best fit values of T and Nh for the bright X-ray sample (82 sources). 
Error bars correspond to 1 cr. The dashed and dotted horizontal lines show the average value of 
r and its rms dispersion respectively. 



42 



P. Tozzi et al.: X-ray spectral properties of AGN in the CDFS 




41 42 43 44 45 

log(^ erg/s) (2-10 keV) 



Fig. 6. Scatter plot of the best fit values of T versus unabsorbed hard rest-frame luminosities 
for the bright X-ray sample (82 sources). Error bars correspond to 1 cr. The dashed and dotted 
horizontal lines show the average value of F and its rms dispersion respectively. 
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Fig. 7. Scatter plot of the best fit values of F versus redshift for the bright X-ray sample (82 
sources). Error bars correspond to 1 cr. The dashed and dotted horizontal lines show the average 
value of r and its rms dispersion respectively. 
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Fig. 8. Normalized redshift distribution for the X-ray bright (solid line, 82 sources) and the X- 
ray faint (dashed Une, 253 sources) subsamples. 
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Fig. 9. The solid line shows the observed Nh distribution for the whole sample (321 sources). 
Error bars refer to Icr poissonian uncertainty due to the limited number of sources in each bin. 
The dashed histogram shows the distribution obtained without resampling according to measure- 
ment statistical errors. No coiTection for incompleteness and volume-sampling effects has been 
appUed. 
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Fig, 10. Significance of the Fe line (measured with simulations, see text) plotted versus the ob- 
served AC. Pointed circles are sources included in the bright sample. The solid line shows the 
significance vs AC for one interesting parameter assuming the same statistics for AC ■mAx'-. The 
vertical line is the threshold corresponding to AC - 271 . 
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Fig. 11. Equivalent width of the Fe line plotted versus the intrinsic absorption Nh for the 20 
sources with Fe line significant at more than 90% c.l. Errors on the equivalent width are de- 
rived from the errors on the normahzation of the line component. Compton-thick candidates are 
plotted at Nh = 1.5 x lO^'* as lower limits to the actual value. 




Fig. 12. Intrinsic absorption versus redshift for the complete sample. Upper limits (1 cr) are used 
for measures consistent with Nh - within 1 cr. Compton-thick candidates are plotted at Nh - 
1.5 X 10^"* as lower limits to the actual value. Error bars correspond to 1 cr. 
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Fig. 13. Unabsorbed rest-frame luminosities in the soft (upper panel) and in the hard (lower 
panel) band plotted versus the intrinsic absorption. Upper limits (1 cr) are used for for measures 
consistent with Nh - within 1 cr. Error bars correspond to 1 cr. Upper right corners outlined 
by the dashed Unes show the locus of QSO-II, defined as sources with Lx > 10^"* erg s ' and 
Nh > 10^^ cm"^ (as opposed to the criterion HR > -0.2 and Lx > 10"''* erg s ' used in Szokoly 
et al. 2004). 
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Fig. 14. Intrinsic minimum rest-frame luminosity in the 2-10 keV band {La,,) of a source that 
can be detected at the CDFS aimpoint as a function of intrinsic absorption for redshifts z - 
0.12 - 0.25 - 1 - 2.15 - 3.5 - 4.5 from the bottom to the top. The assumed model is a Compton 
thin power law with Y - 1.8 plus a reflection component equal to 6% of the hard intrinsic 
luminosity. 
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Fig. 15. Total fraction of all the AGN detected in the CDFS survey as a function of the intrinsic 
Nh, according to the detection criteria in the CDFS and assuming the luminosity function of 
Ueda et al. (2003). The total fraction strongly depends on the minimum detectable luminosity 
and hence, given the flux limit in the CDFS, on the redshift range. Note that low fractions are 
implied by the conservatively low value L„„„ (10"*' ergs s"') which defines the total population of 
AGN. The thick, dashed line is the correction for the whole sample. The three continuous lines 
refer to three intervals in redshifts: z = - 0.7, z = 0.7 - 1.5, z = 1.5 - 5 from top to bottom. 
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Fig. 16. Intrinsic A^h distribution representative of the whole AGN population in our sample 
(corrected for incompleteness and sampling-volume effect, and convolved with the statistical 
errors of each measurement). Errors are obtained from the poissonian uncertainties on the number 
of detected sources in each bin. The dashed curve is a lognormal distribution with {logiNn)) - 
23.1 and cr - 1.1. Compton-thick candidates are all in the bin at Nh - 10^"*. 



p. Tozzi et al.: X-ray spectral properties of AGN in the CDFS 



53 



0.8 



0.6 



o 0.4 








2 
z 



Fig. 17. Fraction of absorbed AGNs with Nh > 10^^ cm"^ to all AGNs with Lx > lO'*' erg 
s ' (2-10 keV band) as a function of redshift (solid circles). Triangles are the data points from 
Ueda et al. (2003) for lO'*^ < Lx < 10*'^-^ erg s"' (2-10 keV band). Rest-frame luminosities are 
computed for a A = 0.7 flat cosmology and Hq = 70 km/s/Mpc. 
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Fig. 18. Normalized Nh distribution for the four different optical classes (see text). 
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Fig. 19. Normalized distribution of the intrinsic rest-frame luminosity in the hard band for the 
four different optical classes (see text). 
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Fig. 20. Upper panel: fraction of BLAGN among the sample of sources with optical redshift as 
a function of the 0.5-10 keV luminosity bin. Lower panel: average 0.5-10 keV luminosity of 
BLAGN as a function of the redshift bin. Only sources with Lx > 10*^ erg s ' are considered in 
the lower panel. 
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Appendix A: Fitted X-ray spectra 

The background-subtracted, unfolded spectra of the sources analyzed in this Paper, along with 
the best fit models, are shown in Figures lAmiA.22l Spectra are binned for display purpose only, 
with the simple criterion of having at least 20 counts or a signal-to-noise of 3 in each bin for 
sources with more than 100 net detected counts. Weaker sources spectra are binned with at least 
10 counts or a signal-to-noise of 2 in each bin. We recall that the binning is used only to plotting 
purpose, while the unbinned spectra are used when performing the fit, as described in the text. 
Each source is fitted with the best-fit model, which is the canonical Compton-thin plus gaussian 
line model for the large majority of the sources (Figures IA.lHA.22t . For 14 Compton-thick 
candidates we show the reflection model plus the gaussian line (Figures fA.20> . while in 8 cases 
we add a soft component (Figures rA.21> . We remind that a reversed edge at 2.07 keV is added to 
each spectrum to take into account a small increase in the efficiency of the ACIS detectors which 
is not yet included in the ACIS response functions in CALDB2.26. Such a feature, visible as a 
small step just above 2 keV, should not be considered intrinsic to the sources. 
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for this figure see"littp://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.ps( 



Fig.A.l. Unfolded spectra and best fit model for the 299 C-thin sources (fitted with a zwabs 
pow model with T = 1 .8 plus a narrow Gaussian line at 6.4 keV rest frame). 

for tliis figure see'http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper _lMspec.ps' 

Fig. A.2. Figure lXn continued. 

for tliis figure see littp://adlibitum.oat.ts.astro.it/tozzi/spectral-CDFS/paper_lMspe c.ps| 

Fig. A.3. Figure lXn continued. 

for tliis figure see littp://adlibitum.oat.ts.astro.it/tozzi/spectral-CDFS/paper_l Mspec.ps| 

Fig. A.4. Figure lATI continued. 

for tliis figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMsp ec.psl 

Fig. A.5. Figure lXn continued. 



for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec, 




Fig. A.6. Figure lXn continued. 
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for this figure see"http://adlibitum.oat.ts.astro.it/tozzi/spectraLCDFS/paper_lMspec.ps( 



Fig. A.7. Figure lATI continued. 



for this figure see http://adHbitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.ps' 



Fig. A.8. Figure IXn continued. 



for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspe c.psl 



Fig. A.9. Figure lATI continued. 



for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.psl 



Fig. A.IO. Figure lATI continued. 
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for this figure see"littp://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.ps( 

Fig. A.ll. Figure lATI continued. 

for tliis figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral-CDFS/paper_lMspe c.psl 

Fig. A.ll. Figure lATI continued. 

for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral-CDFS/paper_lMsp ec.psl 

Fig. A.13. Figure ETTI continued. 

for this figure see .http://adlibitum.oat.ts.astro.it/tozzi/spectral-CDFS/paper-lMspec.ps^ 

Fig. A.14. Figure lATI continued. 

Appendix B: Selection of Compton-thick candidates: spectral simulations 

We describe here the strategy we adopted in order to select Compton-thick candidates on the ba- 
sis of the X-ray spectrum. We also want to evaluate the efficiency of our method, and keep con- 
trol on the fraction of spurious candidates. First, we select a subsample of 110 sources choosen 
among the 321 sources of the sample because of their flat spectrum, with best-fit slope F < 1 
when fitted with a simple power law without absorption. This subsample is expected to include 



for this figure see |http://adlibitum.oat.ts.astro.it/ tozzi/spectral-CDFS/paper-l Mspec.ps| 
Fig. A.15. Figure lATI continued. 
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for this figure see"http://adlibitum.oat. ts.astro.it/tozzi/spectraLCDFS/paper _lMspec.ps( 



Fig. A.16. Figure lATI continued. 

for this figure see http://adHbitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.ps' 



Fig. A.17. Figure lXn continued. 

for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspe c.psl 

Fig. A.18. Figure lATI continued. 

for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspe c.psl 

Fig. A.19. Figure IaTTI continued. 

for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMsp ec.psl 

Fig. A.20. Figure ETTI continued. 

for this figure see http://adlibitum.oat.ts.astro.it/tozzi/spectral_CDFS/paper_lMspec.psl 



Fig. A.21. Unfolded spectra and best fit model for the 14 C-thick sources (fitted with a pexrav 
model). 
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for this figure see"littp://adlibitum.oat. ts.astro.it/tozzi/spectral_CDFS/paper _lMspec.ps( 



Fig.A.22. Unfolded spectra and best fit model for the 8 Soft-C sources (fitted with a pow + 
zwabs pow model). 

the most obscured component of the XRB (see Civano, Comastri & Brusa 2005). Therefore we 
assume that all the Compton thick sources are included in this subsample. 

We also assume for simplicity, that all the sources can be described by two possible spectral 
shape: an absorbed power law for Compton thin sources, and a pure reflection for Compton thick 
sources. With the command fakeit within XSPEC, we simulated 1000 sources with a pure cold 
reflection spectrum, pexrav in XSPEC, with F fixed to 1 .8 and all the other parameters set to the 
default values (Simulation 1). Each simulated source is assigned a redshift and a normalization 
according to the distribution of the redshifts and the net detected counts of the subsample of 
real sources. Then, we simulated another 1000 sources with an absorbed power law (zwabs pow 
model), with a similar redshift and net detected counts distributions (Simulation 2). In Figure lSTI 
we compare the redshift and net detected counts distributions of the simulated sources with that 
of the parent sample of real sources. The values of Nh for the sources simulated with the model 
zwabs pow, are consistently extracted from the distribution we found in the paper (but only for 
Nh < 10^"^ cm"^ to exclude Compton thick sources). 

Then, we analyzed the two sets of simulation both with the pexrav (appropriate only for 
Simulation 1) and zwabs pow (appropriate only for Simulation 2) model. We verified that in 
the the first case we succesfully recover the input values for the normaUzation of the pexrav 
spectra, and in the second case the input values for Nh, within the errors. Finally, we compute 
the difference between the Cash-statistics obtained with the zwabs pow model and that obtained 
with the pexrav model: AC = C^Kapow - Cpexrav The normalized distributions of the values of 
AC are shown in Figure iB^l We recall that we have two free parameters for the zwabs pow 
model and only one for the pexrav model; this explains why the distribution of Simulation 2 
has a much larger tail at negative values of AC. Our goal is to use these distributions to choose a 
fixed threshold AC that allows us to select Compton thick candidates among the parent sample. 
The optimal choice would minimize the number of Compton thin sources mistakenly included 
in the C-thick sample, at the same time recovering the largest fraction of the Compton thick 
population. 

The distribution of AC for Simulation 1 is skewed towards large positive values, as expected 
since the pexrav model is the correct one. The tail at low values of AC is a measure of how many 
Compton thick sources may be missed when choosing a fixed threshold in AC. The distribution 
of AC for Simulation 2 is centered around negative values, since here the pexrav model is not 
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appropriate. Therefore, the tail at high values of AC is a measure of how many sources with an 
actual zwabs pow spectrum are mistakenly selected as C-thick candidates for a fixed threshold 
in AC. 

The simplifying assumption that our subsample of real sources includes only C-thin and 
C-thick sources, reads Nwt - Nc-iiuck + Nc-iim where Niot is the total number of sources in the 
subsample (here 1 10). We collect Nc sources as C-thick candidates by selecting the sources from 
the parent sample for which AC > AC. The expected value for Nc is: 



where Fi and F2 are the probabilities that a C-thick source is correctly recovered, and that 
a C-thin source is mistakenly included among the C-thick candidates, respectively. We can esti- 
mate Fi and F2 by integrating the two distributions for AC > AC. 

Therefore, the actual fraction of the C-thick sources in the parent sample fcr = N c-thick IN tot 
can be estimated as: 



Our estimated fcT should not depend on AC if our initial assumption Ntot - N c-thick + Nc-tim 
is correct. However, we know that the picture may be complicated by the presence of sources with 
soft component, or the lack of a proper treatment of the Compton scattering when Nh approach 
the Compton thick value of 1 .5 x 10^"* cm"^ (which would require the use of the model plcabs). 
However, a more detailed treatmend would go beyond the scope of this Paper. We find that for 
AC > 1 the expected values for fcT ranges between 0.10 and 0.20 (see Figure 1531 . Note here 
that this fraction is computed among the parent sample of 1 10 sources, therefore it corresponds 
to a number between only 10 and 20 C-thick sources in the whole CDFS sample. 

Obviously, for higher values of AC, the quality of the C-cthick condidates sample is increas- 
ing, while the fraction of true C-thick sources actually recovered drops. Assuming fcT - 0.15, 
we plot in figure lBT^ the following quantities as a function of AC: the expected fraction of true 
C-thick sources among the candidates (fgood)', the expected fraction of spurious sources among 
the candidates (fsp)', the fraction of the total C-thick source population actually recovered (free)- 
We notice that for AC < 2 more than 50% of the total C-thick source population is recovered. 
Therefore we assume AC = 2. 

The number of C-thick candidates we find for AC = 2 is 14 (see text). The number of spu- 
rious sources among the C-thick candidates turns out to be still significant, between 3 and 5. To 
summarize, we demonstrated here that a selection of C-thick sources on the basis of the X-ray 
spectrum is feasible. We also show that we can quantify the completeness and the contamination 
of our C-thick candidate sample. We also notice that the level of contamination is not negligi- 
ble, pointing towards the need of a more sophisticated X-ray spectral analysis, including, for 
example, the systematic search for the Fe line expected more frequently in reflection-dominated 
spectra, or considering the presence of a soft component that can mimick a flat spectrum (see 



Nc = Nc-,hick * FI+ Nc-,hin * F2 , 



(B.l) 



fcT = {NclN.o, - F2)/(Fi - F2). 



(B.2) 
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Weaver et al. 1996). Overall, we believe that a refined version of this approach can constitute a 
valuable tool to look for Compton thick sources on the basis of the X-ray data only. 
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Fig. B.l. Normalized redshift distribution (upper panel) and net detected counts (lower panel) 
for the subsample of real sources with flat spectrum (continuous lines) and for sources simulated 
with a pexrav spectral model (dashed lines - Simulation 1) and with a zwabs pow spectral 
model (dotted lines - Simulation 2). 



Appendix C: Measures of Nh at high-z: simulations 

The best fit values of Nh may show a spurious trend with redshift, due to the increasing difliculty 
of measuring Nh when the rest-frame soft band, which is most sensible to the intrinsic absorp- 
tion, is shifted out of the Chandra energy range. What happens typically is that the error bars are 
so large that, while the upper limits to Nh increase with redshift, the best fit values may fluctuate 
at large positive values also when the source has a negligible intrinsic absorption. We take into 
account part of this effect by resampling the value of Nh according to the error bars, however 
since Nh is always defined as positive, it is hard to avoid an average trend of increasing intrinsic 
absorption with redshift. In particular, we asked ourselves if the absorbed BLAGN found at z > 2 
may be spurious (see §7). 
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Fig. B.2. Normalized distribution of AC for Simulation 1 (continuous line) and for Simulation 2 
(dashed line). The vertical dotted line correspond to the choosen threshold AC - 2. 

To investigate this effect, we performed three sets of simulations (100 spectra each) of sources 
with negligible absorption (equal to the galactic value 9 x 10'^ cm"^) with redshift distributed 
uniformly in the range z = - 4. The average number of net detected counts in each set of 
simulations is 490, 150 and 80. In Figure ICTI we show the best fit values of Nh (with 1 cr error 
bars) plotted versus redshift. We also plot separately the cases in which Nh > at more than 1 
cr. We notice that, while the upper limits increases following approximately the (1 + z)^ law, the 
number of false detections of a non-negligible intrinsic absorption (Nh > at more than 1 cr) is 
not increasing significantly with redshift. In addition, we find that at 2 cr confidence level, all the 
values of Nh are consistent with negligible absorption. 

Therefore, the sample of BLAGN we discuss is consistent with no absorption in average 
(since Nh - for all the BLAGN at 2 cr c.l.). However the number of sources with best fit 
value Nh > 10^^ cm"^ among the BLAGN is 7, and it is larger than that expected for negligible 
absorption on the basis of the simulations (which is about 3). We conclude that some of the high- 
z BLAGN do have absorption at the level of Nh ~ 10^' cm still consistent with that found in 
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Fig. B.3. Estimated value of the fraction of C-thick sources among the parent sample (110 
sources) as a function of AC according to eg. IB .21 

Type I AGN. Values typical of Type II AGN, larger than 10^^ cm"^, would have been detected, 
while values around 10^' cm"^ are compatible with what we found in the data (see Figure IC2t . 
This picture is confirmed by the analysis of the XMM data for these sources (see discussion in 
§7). 

To summarize, we find that the effect of an artificially increasing measure of intrinsic ab- 
sorption with redshift is under control in our sample. A complete removal of the spurious trend 
would need extensive simulations and several iterations, since the effect depends on the actual 
behaviour of Nh with redshift. Such an approach goes beyond the scope of this Paper, but it 
should be adopted for larger AGN samples. 
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Fig. B.4. Upper solid line: expected fraction of true C-thick sources among the candidates (fgood) 
versus AC; lower solid line: expected fraction of spurious sources among the candidates (fsp)', 
dashed line: fraction of the total C-thick source population actually recovered (free)- 
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Fig. C.l. Lef panels: Best fit values of Nh as a function of redshift for simulated sources with 
input values Nh = 9 x 10''' cmT^. The dashed line is 9 x 10'''(1 + zf" ■ Error bars correspond to 1 
cr. The typical net detected counts are 480, 150 and 80 from top to bottom. Right panels: values 
of Nh for which A^h > at 1 cr. 




Fig. C.2. Best fit values of Nh as a function of redshift for simulated sources with input values 
Nh = 5 X 10^^ cm"^ (left panel) and Nh - 10^' cm"^ (right panel). Error bars correspond to 1 cr. 
The typical net detected counts are 280 (left panel) and 470 (right panel). 



